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METHOD FOR THE IDENTIFICATION OF MICROORGANISMS 
BY THE UTILIZATION OF DIRECTED AND ARBITRARY 
5 DNA AMPLIFICATION 

FJEED OF THE INVENTION 
The present invention relates to the field of 
microbial identification using DNA amplification. More 
particularly, the present invention relates to the 
10 identification of species of prokaryotic organisms , 
based on a characterization of products generated by 
amplification of specific genomic regions significant to 
the organism. In addition by amplification of arbitrary 
sections of the genome in conjunction with the 
15 _ amplification of specific genomic regions, 

identification of species, serotype and strain of the 
microorganism is achieved. 

PfrCKGRQPNP OF TffE INVENTION 
A variety of literature and references have been 
20 recently published pertaining to the identification of 
microorganisms such as bacteria in suspect samples. In 
these materials much emphasis has been placed on 
developing procedures to identify or classify 
microorganisms using a presumptive basis. That is, the 
25 researcher employs the technology to screen for a 

particular microorganism or pathogen of interest. Still 
other references require laborious screening with a 
variety of specific primers or incorporate varieties and 
carefully controlled operating constraints. As will 
* 30 become apparent from the ensuing discussion, the present 
invention obviates these disadvantages. 

PCT Publication WO89/10414 to Wallace et al. 
discloses a process for the creation of new genetic 
markers and the determination of multiple genetic 
35 markers. A genomic DNA sample is amplified, and the 
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sequence variation within the amplified fragment is 
detected . The amplified fragments are then detected. 
This publication teaches a general concept wherein a 
polynucleotide target sequence including a plurality of 
5 polymorphic loci are amplified, with the product 

analyzed for the presence or absence of target sequences 
having the locus specific characteristic. However r ' 
Wallace et al. do not focus on the amplif ication of 
spacer regions interspersed between highly conserved 
10 rDNA sequences for detecting microorganisms. The 

reference does not address this approach either alone or 
together with arbitrary regions from the entire genome. 

EP 0 332 435 is directed to a method of detecting 
one or more variant nucleotide sequences; The nucleic 
15 acid sample is contacted with a diagnostic primer 

substantially complementary to a diagnostic portion of a 
target base sequence. Extension only occurs where a 
terminal nucleotide of the -primer is complementary to a 
variant or normal nucleotide of the target base 
20 sequence. The extension product if any is detected. 
This approach does not target the spacer regions 
interspersed between conserved rDNA sequences for 
amplification and subsequent analysis, as presently 
cont enqplat ed • 

25 Williams et al., "DNA -polymorphisms amplified by 

arbitrary primers are useful as genetic markers", 
Nucleic Acid Research, Vol. 18, No. 22 p. 6531-6535 and 
Welsh et al., "Fingerprinting genomes using PCR with 
arbitrary primers", Nucleic Acid Research, Vol. 18, No. 

30 24 p. 7213-7218, both demonstrate the use of arbitrary 
primers in a DNA amplification reaction to generate a 
characteristic pattern of amplification products from 
genomic DNA from a variety of sources including 
bacteria. The approach used by Williams et «1. employs 
35 small primers run at relatively high stringency 
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conditions* The polymorphisms are called Random 
Amplified Polymorphic DNA (or RAPD) markers, and are 
useful to construct genetic maps in a variety of 
species. Welsh et al. describe the use of longer 
5 primers in an amplification process which uses low 
stringency conditions in the early cycles followed by 
higher stringency conditions in the later cycles. In 
both cases, bacterial strains are identified by 
comparison of the arbitrarily primed genomic print with 
10 predetermined reference patterns. 

The procedures described by Williams et al. and 
Welsh et al. both employ the use of arbitrary primers 
with a relatively high number of amplification cycles. 
Significant levels of formation of secondary 
15 amplification products and nonspecific DNA synthesis 
frequently result from amplifications run under these 
conditions. These amplification products are separated 
by electrophoresis and the resulting pattern is then 
analyzed and processed by pattern recognition software, 
20 Amplification with a single arbitrary primer, such 

as is described by Williams et al. and Welsh et al. may 
yield an arbitrary product pattern which possesses both 
common elements at the level of species and 
differentiated elements at the level of strain for a 
25 given species. However, a single arbitrary primer or 
pair of arbitrary primers may not demonstrate this 
property across a broad spectrum of microorganisms . 
Hence, different species of microorganisms may require a 
diverse menu of arbitrary primers to achieve common 
30 pattern elements at the level of species and 

differentiated elements at the level of strain. The 
present invention represents an improvement over the 
procedures described by Williams et al. and by Welsh et 
al. in that it does not require screening unknown 
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microorganisms with a battery of possible primers or 
making a presumptive preliminary identification. 

French Patent No. 2, 63 6, 075 concerns the detection 
of bacteria/ yeasts , parasites and other eukaryotic 
5 microorganisms in, for example, food products . 

Ribosomal RNA if any is extracted from the product, and 
transcribed to DNA in the presence of reverse 
transcriptase. The DNA strand is transcribed -to a 
complementary strand in the presence of primers, and 
10 then amplified in the presence of one or more primers 

which define a known detectable sequence. ' The amplified 
sequence is detected by electrophoresis iit an acrylamide 
gel or by hybridization using probes which cover in part 
the amplified region. 
15 Vilgalys et al., "Rapid genetic identification and 

mapping of enzymatically amplified ribosomal DNA from 
several Crypt ococcus species", Journal of Bacteriology, 
Aug. 1990, p. 4238-4246, describe a simplified procedure 
for performing the restriction typing and mapping 
20 necessary to identify related species of pathogenic 
fungi. An approach which uses a polymerase clxain 
reaction (as below) to amplify a known region of 
ribosomal DNA followed by a restriction digest and 
elect rophoretic separation is discussed. 
25 The identification procedures described in French m 

Patent No. 2,636,075 and by Vilgalys et.al., both employ 
multiple enzymatic processing steps which require 
purification of the intermediate products. BOtb. the 
multiplicity of enzymatic steps and the necessity for 
30 intermediate product isolation make these methods 
expensive and labor intensive. 

Mullis et al., U.S. Patent No. 4,683,195/ and 
Mullis, U.S. Patent No. 4 r 683/202 disclose polymerase 
chain reactions (referred to as the PGR procedure) which 
35 can be used to amplify any specific segment of a nucleic 
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acid. These analytical methods have been used to detect 
polymorphisms through amplification of selected target 
DNA segments from test genomes . A drawback to such 
methods is the requirement that a sufficient number of 
5 bases at both ends of the specific segment be known in 
sufficient detail so that two oligonucleotide primers 
can be designed which will hybridize to different 
strands, of the target segment. It is labor intensive to 
obtain the necessary sequence information from target 
10 genomes in order to design the necessary primers. 

It is an object of the present invention to provide 
a method for microbial identification at the level of 
genus and species by the detection of variations in 
length and number of fragments located between highly 
15 conserved rDNA sequences. These fragments are referred 
to as Ribosomal Sequence (RS) fragments. It is a 
further object of the present invention to provide a 
method for microbial identification using the preferred 
PGR procedures. It is a further object of the present 
20 invention that only a single pair of nucleic acid 

sequences will be required for the amplification which 
provides the basis for the method of microbial 
identification. To achieve this end, the variations in 
the aforementioned RS fragments are characterized by 
25 amplification of variable sequences (as spacer regions) 
interspersed between regions known to contain highly 
conserved sequences (as rDNA sequences) . These DNA 
sequences may flank the variable sequences or may 
otherwise be located in relation to the variable 
30 sequences. It is yet a further object of the present 
invention that the identification is extended to the 
level of serotype and strain through modifications which 
facilitate the amplification of additional arbitrary 
regions of the microbial genome in conjunction with the 
35 amplification of the variable sequences interspersed 
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between the highly conserved rDNA sequences. A feature 
of the present invention is its compatibility with 
various other techniques common to microbiological and 
molecular biological efforts , including but not limited 
5 to electrophoresis and staining (as by ethidium bromide) 
or spectrophotometric procedures. 

It is an advantage of the -present invention that 
the same pair of primers are used for all species of 
microorganism to generate amplification products. Since 
10 the sequences of these primers are highly conserved 
among prokaryotic organisms these primers are 
generically applied. Amplification from this single 
pair of primers generates products from conserved 
sequences in a known genetic locus which are 
15 characteristic of a given species. Additional products 
are generated by the arbitrary amplification events and 
are used to differentiate strains within a species. A 
substantial savings of time and expense is realized 
because the necessity for screening or presumptive 
20 identification has been eliminated. 

It is another advantage of the present invention 
that use of the method recited herein yields a pattern 
upon electrophoretic separation of the reaction 
products , which contains a minimal amount of secondary 
25 and nonspecific amplification products. These patterns 
are more amenable to analysis and processing by the 
types of pattern recognition software which are used for 
pattern comparison with a reference data base such as is 
described in U.S. Patent No. 4 r 885,697. 
30 It is yet another advantage of the present 

invention that the method recited herein requires only a 
single enzymatic processing step with no subsequent 
sample purification prior to electrophoretic separation. 
As such, the method is not labor intensive and results 
35 in a significant savings of time. 
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These and other objects,, features, and advantages 
of the present invention will become more readily 
understood upon having reference to the following 
description of the invention. 

5 SUMMARY OF THE INVENTION 

Described herein is a method for the identification 
of the species of a microorganism. This method 
comprises : 

(a) isolating genomic DNA from the 
10 microorganism, said genomic DNA comprising variable 
sequences interspersed between highly conserved 
sequences; 

<b) amplifying said variable sequences 
thereby producing fragments having particular 
15 distributions in size and number; and 

(c) separating said fragments by size and 
analyzing the distribution of said fragments thereby 
determining the species of the microorganism. 

Further described herein is a method for the 
20 identification of the species serotype and strain of a 
microorganism comprising: 

(a) isolating genomic DNA from the 
microorganism, said genomic DNA comprising variable 
sequences interspersed between highly conserved 

25 sequences in addition to arbitrary regions along the 
genome thereof; 

(b) amplifying said variable sequences 
thereby producing fragments having particular 
distributions in size and number; 

30 (c) amplifying said arbitrary regions thereby 

producing Arbitrary Genomic (AG) fragments having 
particular distributions in size and number 
simultaneously with (b) ; and 

(d) separating said fragments and said AG 

35 fragments by size and analyzing the distribution of said 
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fragments and said AG fragments thereby determining the 
species, serotype and strain of the microorganism. 

These methods are particularly useful for 
determining species, serotype and strain of bacteria, 
5 such as are resident as food borne microorganisms. 

In the preferred embodiment, the highly conserved 
sequences are highly conserved' rDNA sequences. Hence 
the fragments are RS fragments. 

According to a preferred embodiment of the methods 
10 disclosed herein, step (a) isolation comprises cellular 
lysis and deproteinization. Further , step (b) 
amplification may preferably comprise annealing at least 
one pair of oligonucleotide primers to the highly 
conserved rDNA sequences. A nucleic: acid polymerase and 
15 nucleotide triphosphates are introduced to the primers, 
suitable to amplify the variable sequences interspersed 
between the highly conserved rDNA sequences. 
Alternatively the oligonucleotide primers are annealed 
to the aforementioned sequences and to a plurality of 
20 arbitrary regions, ensuring suitable amplification of 
the arbitrary regions-. 

DETAILED DESCRIPTION OF THE FIGURES 
Figure 1 is a generalized schematic of the rRNA 
genetic locus of bacteria. This locus contains the 
25 conserved sequence regions where the priming for the 

amplification occurs and tbe spacer region between these 
priming sites. 

Figure 2 is the visualized pattern for the 
amplification products of the spacer region in the rRNA 
30 genetic locus for four species from the genus Listeria 
including Listeria monocytogenes. 

Figure 3 is the visualized pattern for the 
amplification products of the spacer region in the rRNA 
genetic locus for four serotypes from the genus 
35 Salmonella including Salmonella typhimurim. 
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Figure 4 is the visualized pattern for the 
amplification products of the spacer region in the rRNA 
genetic locus for five species from the genus 
Staphylococcus including Staphylococcus aureus. 
5 Figure 5 is the visualized pattern for the 

amplification products of the spacer region in the rRNA 
genetic locus for five species" from the genus 
Escherichia including Escherichia coli. 

Figure 6 is the visualized pattern for the 
10 amplification products of the spacer region in the rRNA 
genetic locus for eight additional species taken from 
four genera which are related to the pathogenic species 
of interest. These species are as follows: Citrobacter 
freundii, Citrobacter diversus, Enterobacter aerogenes, 
15 Enterobacter agglomerans, Enterobacter cloacae, Proteus 
mlrabilis, Proteus vulgaris and Yersinia enterocolitica. 

Figure 7 is the visualized pattern for the 
amplification products of both the spacer region in the 
rRNA genetic locus and the arbitrary regions from the 
20 entire microorganism genome for three species from the 
genus Listeria including Listeria monocytogenes. 

Figure 8 is the visualized pattern for the 
amplif ication products of both the spacer region in the 
rRNA genetic locus and the arbitrary regions from the 
25 entire microorganism genome for three serotypes from the 
genus Salmonella including Salmonella typhimurim. 

Figure 9 is the visualized pattern for the 
ampl if ication products of both the spacer region in the 
rRNA genetic locus and the arbitrary regions from the 
30 entire microorganism genome for three species from the 
genus Staphylococcus including Staphylococcus aureus. 

Figure 10 is the visualized pattern for the 
amplification products of both the spacer region in the 
rRNA genetic locus and arbitrary regions from the entire 
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microorganism genome for four strains of Escherichia 
coli and two strains of Citrobact&r freundii. 

PETMLEP DESCRIPTION QF THE JNVWION 
The method described herein is useful in 
5 identifying a wide variety of microorganisms. 

Representative but not exhaustive of the many types of 
organisms including both genus , species and serotype 
that may be elicited through the use of the present 
procedures are Listeria monocytogenes , Listeria 
10 welshimerl, Listeria innocua, Listeria ivanovii, 
Salmonella typhlmurium, Salmonella enteritidis, 
Salmonella newport, Salmonella infantis, Staphylococcus 
aureus. Staphylococcus scuiri, Staphylococcus warneri, 
Staphylococcus saprophyticus t Staphylococcus 
15 epidermidusr Escherichia coli, Escherichia fergusonii, 
Escherichia blattae, Escherichia hermanli, Escherichia 
vulneris, Citrohacter freundii, Citrobacter diversus, 
Enterobacter aerogenes, Enterobacter agglomerans, 
Enterobacter cloacae, Proteus mirabills, Proteus 
20 vulgaris, and Yersinia enterocolitica • Such a listing 
may form a database of previously visualized products 
which when compared to the electrophoresed, visualized 
fragment products according to the present method, 
afford an identification of the species (and the 
25 serotype and strain if applicable) . 

It is readily appreciated by one skilled in the art 
that the present method may be applied to microorganisms 
in the context of a wide variety of circumstances . 
Thus, a preferred use of the present invention is in the 
30 identification of microorganisms in food* Additionally, 
research directed to microbial infections in humans 
other animals and plants would benefit from the 
procedure herein. 

The present invention relates to the identification 
35 of species of prokaryotic organisms/ based on a 
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characterization of products generated by amplification 
of the spacer region found between the 16S and 23S 
regions of the rRNA genetic locus. The rRNA genetic 
locus is a genetic unit which is found in prokaryotic 
5 cells. Significant portions of the nucleic acid 

sequence which make up this genetic locus are common to 
all prokaryotic organisms. (Figure 1 showns a 
generalized schematic of this locus.) The overall 
relatedness of the 16S, 23S, and 5S regions of this 

10 genetic locus has been used as a tool to classify 

.differing species of prokaryotes. Genes coding for the 
16S ribosomal RNA region have been shown to contain 
regions of highly conserved sequences which are 
interspersed between areas of less conserved sequences. 

15 Although less well characterized than the 16S region the 
23S ribosomal RNA region has also been shown to have a 
similar makeup. 

The 16S, 23S and 5S gene sequences are separated by 
spacer units. These spacer units (also referred to as 

20 spacer regions) exhibit a large degree of sequence and 
length variation at the levels of genus and species for 
prokaryotic organisms. Within a single genome of a 
given species there are frequently multiple rRNA genetic 
loci present. The spacer regions found within these 

25 loci also show a significant degree of variation in 

length and sequence. It has been shown that conserved 
sequences of the 16S region can be used as sites for 
amplification of nonconserved sequences within the 16S 
region for the purpose of determining their nucleotide 

30 composition. Similarly, it is reasonable to expect that 
conserved regions on the 3' and 5* ends of the 16S and 
23S rDNA, respectively, can be used as priming sites for 
the amplification of the the spacer unit contained 
between these two conserved regions. It is expected 

35 that amplification of such spacer regions will produce 
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fragments whose size and number is characteristic of a 
given species of prokaryote. 

The 16S regions of rRNA genetic loci have been 
sequenced for a broad range of bacteria. A compilation 
5 of this sequence data is given in Dams et aL, 

"Compilation of small ribosomal subunit RNA sequences", 
Nucleic Acids Research, Vol, 16 r supplement r p. r87* 
This sequence data was examined for the purpose of 
identifying a highly conserved sequence in the 16S 
10 region immediately adjacent to the spacer region. The 

sequence which was chosen was GAAGTCGTAACAAGG which will 
be designated as PI. This sequence lies in a highly 
conserved region approximately 30-40 bases away from the 
spacer region. 
IS There is substantially less sequencing data 

available for the 23S region. Gutell et al., "A 
compilation of large subunit RNA sequences presented in 
a structural format n / Nucleic Acids Research/. Vol. 16 r 
supplement, p. rl75 present a small collection of 23S 
20 sequences consisting of five bacterial and four plant 
chloroplast sequences- The second primer- chosen 
represented the most conserved sequence among the 
bacterial sequences presented which could be found 
immediately following the spacer region. The sequence 
25 which was chosen was CAAGGCATCCACCGT which will be 

designated P2. This sequence was conserved among the 
five bacterial examples cited and was located 
approximately 20 bases away from the spacer region. The 
fact that this sequence also matches the plant 
30 chloroplast sequence in the same region for thirteen of 
fifteen bases (87% similar) gives additional reason to 
expect that this particular sequence is evolutionarily 
stable and likely to be conserved over a broad spectrum 
of bacteria. 
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Primers for both the 16S and 23S regions were 
limited to seventeen bases in length (and it is 
preferable to use 15-17 bases) because longer primers 
would extend into regions of more poorly conserved 
5 sequence. Primers which overlap regions that are not 
conserved among all bacteria are expected to show more 
variation in amplification efficiency. This would make 
it more difficult to specify a single set of primers and 
amplification conditions for all bacterial genomic DNA 

10 samples. It is possible that some sequences which are 
not ribosomal spacers may be amplified by the primer set 
of PI and P2. This type of occurrence is expected to be 
rare but amplification of nonspacer sequences even under 
highly stringent conditions can not be ruled out. Such 

15 an amplification does not detract from the utility of 
the method and will likely provide additional pattern 
information for the differentiation of microorganisms 
below the species level. Any of the amplified fragments 
produced by the high stringency amplification with 

20 primers PI and P2 will be referred to as RS fragments. 
This name serves to indicate that the sequences for 
these primers was taken from highly conserved rDNA 
sequences flanking a ribosomal spacer region. 
It is frequently necessary to carry the 

25 identification of pathogenic species beyond the level of 
species to serotype and strain. This information is 
valuable, for example, in identifying the source of food 
• contamination and in tracing the course of epidemic 
infections. Although the patterns generated from the 

30 amplification of the spacer regions in the rRNA genetic 
locus contain sufficient information to identify 
species, they do not always contain sufficient 
information to carry the identification to the level of 
serotype and strain. Ribosomal RNA genes constitute 

35 only about 1% of the total prokaryotic genome. In order 
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to identify species at the serotype and strain level, we 
need to gather additional information relating to the 
genomic composition of those areas located outside the 
rRNA genes. 

5 A procedure for generating arbitrary amplification 

fragments from genomic material has. been demonstrated by 
Williams et al. The approach describes the use of a 
small oligonucleotide, i.e., approximately 10 bases, of 
arbitrary composition in a DNA amplification reaction,* 
10 Short primers are used in order that complementary and 
reverse complementary sequences to the primer can be 
found at distances along the genome which are 
sufficiently small that DNA amplification * can take 
place. The fragments generated in the amplification 
15 process are called Random Amplified Polymorphic DNA RAPD 
markers. These RAPD markers show a size distribution 
which is sensitive to modest differences in the genomic 
makeup of the DNA used in the amplification process. 

In the present invention we provide a method for 
20 the generation of arbitrary amplification fragments from 
the genome of the target microorganism in conjunction 
with the generation of amplification fragments from the 
spacer regions in the rRNA genetic locus. Amplification 
of the putative rDNA spacer regions is achieved using 
25 15-17 base primers and annealing temperatures of up to 
about 72°C (and preferably up to about 55°C) for a 

duration of about 1-20 (preferably about 2-10) minutes 
in the amplification reaction. Differentiation of 
species at the serotype and strain level is achieved by 

30 reducing the size of the primers used in the 

amplification, while still maintaining the conserved 
element of their sequence. By decreasing the size of 
the primers to 11 bases in length (and it is preferable 
to use 10-12 bases) and reducing the annealing 

35 temperature to 43-4 6°C for a duration of about 1-10 
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minutes (preferably about 5-10 minutes) , it is possible 
to generate both RS fragments and RAPD markers from the 
genomic DNA using a single pair of primers . Since the 
RAPD markers are generated by amplification of arbitrary 
5 genomic sequences they will be referred to as AG 
fragments. The combined amplification of RS and AG 
fragments will be referred to as RS/AG amplification. 
The RS fragments which are produced in the amplification 
reaction are sufficiently distinct to permit 
10 differentiation of microorganisms at the level of 
•species. The additional AG fragments which are 
generated provide sufficient additional diversity to 
differentiate microorganisms at the serotype and strain 
level . 

15 It is an important feature of this invention that 

each RS/AG amplification pattern contains RS fragments 
which can be used to identify species . This means that 
unique strains of a given species will contain common 
pattern elements which will indicate that the strains 

20 are derived from a common species. When a strain is 
encountered which is not represented in the reference 
pattern database, the species of the strain can still be 
identified providing that the RS fragments for that 
species are contained in the reference pattern database. 

25 In order to perserve the conserved nature of the 

primers used to generate the RS/AG pattern the 11-base 
primers used in the amplification reaction are a subset 
of the 15-base primer sequences. The sequence which was 
chosen for the 16S area adjacent to the spacer was 
30 GAAGTCGTAAC, which will be designated as P3. The 

sequence which was chosen for the 23S area following the 
spacer region was AAGGCATCC AC , which will be designated 
P4. 

A significant degree of intramolecular 
35 hybridization is known to occur within the rDNA genetic 
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locus. The resulting secondary structure frequently 
makes it difficult for amplification primers to compete 
for hybridization sites. In order to enhance the 
amplification of fragments contained within the rDNA 
5 region it is necessary to modify the amplification 

temperature profile which is typically practiced. The 
principal modifications consist of the use of 
substantially longer annealing times. Seven minutes is 
used for PI and P2 for the generation of the RS 
10 fragments. Eight minutes is used with P3 and P4 to 

generate both RS and AG fragments in a single reaction. 
In both cases , amplification reactions are being run 
under high stringency conditions in conjunction with a 
decreased number of amplification cycles. A high 
15 stringency amplification is accomplished by running the 
reaction at the highest annealing temperature where 
products are reproducibly formed. For the primer set PI 
and P2 the maximum temperature which yielded 
reproducible product formation was 55°C. For the primer 
20 set P3 and P4, the maximum temperature which yielded 

reproducible product formation was 43°C. Use of maximum 
annealing temperature insures that only the most stable 
hybridization structures will form and that the areas 
surrounding the priming sites will possess a minimal 
25 amount of secondary structure* 

There are two reasons for the use of a long 
annealing time in the amplification reaction. First, 
the polymerase has sufficient time to find the majority 
of primer-hybridized sites, resulting in an increase in 
30 the overall efficiency of each amplification cycle. 
Second, initiation sites for amplification will be 
governed by which annealing sites form the most stable 
primer-duplex rather than by which site has the greatest 
accessibility. This significantly improves the 
35 reproducibility of the pattern because the amplification 
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sites are competing based on primer hybridization 
equilibria rather than on kinetic accessibility. Since 
relative equilibria are far less sensitive to small 
variations in temperature and ionic strength than are 
5 the relative kinetics of accessibility to a specific DNA 
site, the integrity of the resulting pattern is less 
sensitive to small variations of ionic stength and 
temperature. The increased efficiency of the 
amplification process makes it possible to generate a 
10 reproducible and easily detectable pattern within 25 
amplification cycles for the RS amplification and 28 
cycles for the RS/AG amplification. 

The arbitrary priming procedures reported by 
Williams et al. and Welsh et al. both require a large 
15 number of amplification cycles, 4 5 and 32-42, 
respectively. Use of such a high number of 
amplification cycles frequently results in the formation 
of secondary amplif ication products and nonspecific DNA 
synthesis. A product profile background which contains 
20 high levels of such secondary amplification products and 
nonspecific DNA can severely restrict the ability of 
pattern recognition software to compare such a product 
profile with a known database. Use of fewer 
amplification cycles with 7-8 minute annealing times 
25 produces a far less complex product profile with a 

significantly reduced nonspecific DNA background. When 
the products from the RS or RS/AG amplifications are 
separated by electrophoresis the resulting pattern can 
be easily analyzed and processed by the types of pattern 
30 recognition software which are used for comparisons with 
a reference database. 

GENIAL SBQCEBIZBES 

The bacterial cells are pelleted and then 
35 resuspended in lysis buffer (10 mM tris-HCl, 10 mM NaCI, 
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50 mM EDTA at pH 8.0) . The cells are subjected to an 
enzymatic digest comprised of 25 U Lysostaphin, 30 ug 
N-Acetylmuramidase, 400 ug Achromopeptidase, and 600 ug 
Lysozyme in a final volume of 306 ul of lysis buffer. 
5 The digestion is carried out for 30-45 minutes at 37°C. 
Lysates are extracted with chloroform/phenol, then 
ethanol precipitated. The DNA* Is redissolved in a 
tr is /EDTA buffer and the final concentration is 
determined by spectrophotometry measurement. 
10 Amplification of PNA 

An amplification reaction is generally described in 
Mullis et al., U.S. Patent 4 r 683 r 195 and Mullis, U.S. 
Patent 4,965/188, which patents are incorporated by 
reference. U.S. Patent 4,683,195 is relevant for its 
15 teaching of the basic PCR approach, and U.S* Patent 

4,965,188 for its teaching of thermophilic polymerases. 
Specific conditions for amplifying a nucleic acid 
template are described in M» A. Innis and D. H. <3elfand, 
W PCR Protocols, A Guide to Methods and Applications", 
20 M. A. Innis, D. H. Gelfand, J.-J. Sninsky and T. J. 
Whiter eds. pp. 3-12, Academic Press (1989), which is 
incorporated by reference. 

Specifically, Mullis is directed to a process for 
amplifying any desired specific nucleic acid sequence 
25 contained in a nucleic acid or mixture thereof. The 
process of Mullis comprises treating separate 
complementary strands of the nucleic acid with a molar 
excess of two oligonucleotide primers, and extending the 
primers to form complementary primer extension products 
30 which act as templates for synthesizing the desired 
nucleic acid sequence. The primers of Mullis are 
designed to be sufficiently complementary to different 
strands of each specific sequence to be amplified. The 
steps of the reaction may be carried out stepwise or 
35 simultaneously and can be repeated as often as desired 
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(amplification is to be performed at least twice for the 
present invention) • 

In the context of the present invention, genomic 
DNA is isolated by cellular lysis and deproteinization . 
5 Amplification of the RS fragments (and arbitrary genomic 
regions if applicable) comprises annealing at least one 
pair of oligonucleotide primers to the highly conserved 
rDNA sequences (and to a plurality of arbitrary regions 
if applicable) . In this procedure a nucleic acid 

10 polymerase and nucleotide triphosphates are introduced 
to the primers. The procedure is carried out under 
conditions suitable to amplify the RS fragments (and the 
arbitrary regions if applicable) . 

Preferred nucleic acid polymerases are DNA 

15 polymerases, particularly those which are thermostable. 
Preferred nucleoside triphosphates include deoxyribo- 
nucleoside triphosphates. 

To facilitate identification of a microorganism 
from a pattern of araplif ication products generated from 

20 the genome of that organism, the pattern should contain 
elements common to all members of the species in 
conjunction with elements which differentiate the unique 
strains within that species. This attribute makes it 
possible to develop a species identification of a new 

25 strain, which is not contained within the reference 

pattern database, providing that the common elements of 
that species pattern are contained within the reference 
database. If different strains of the same species each 
generate completely unique patterns with no common 

30 species elements then the patterns themselves will not 
identify the species of an unknown strain. 

Amplification of RS Fracrments 
DNA samples are diluted to a concentration of 20 
ng/ul prior to amplification. A 1.25 ul aliquot of the 

35 bacterial genomic DNA is combined with 2.5 ul of 10X 
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reaction buffer , 1 ul of a dNTP mixture (5 mM ea.) r 
1,25 ul each of two 15-base oligonucleotide primers (PI 
and P2 at 50 ng/ul) and 42 ul of deionized water, (The 
primers are obtained from Research Genetics and are used 
5 without further purification. Other primers from other 
sources performing the same function may be used.) This 
mixture is heated to 94°C for 5 minutes and 1*3 units of • 
a thermophilic DNA polymerase is added to the mixture. 
Acceptable polymerases include Tag Polymerase, a 
10 thermostable DNA polymerase isolated from Thermus 
Acfuaticus YT1 according to a purification procedure 
developed by Cetus Corporation/ and available from 
PerJcin Elmer Cetus, Norwalk, CT. Twenty-five 
amplification cycles are performed on an automated 
15 thermocycler. The cycle format is as follows': 1 minute 
at 94°C (denaturing step) ; 2 minute ramp to 55°C; 7 
minutes at 55°C (annealing step) ; 2 minute ramp to 72°C 
and 2 minutes at 72°C (synthesis step) . The final cycle 
is followed by an additional 7 minutes at 72°C to allow 
20 partial polymerizations to run to completion, At the 
end of the cycling program 1 ul of EDTA (0.5M) is added 
as a stop solution and the products ar^ stored at 4°C. 
ftropliff itttivn of ft? figments ?tn<fl 
arbitrary genomic regions ffig/flg) 
25 DNA samples are diluted to a concentration of 20 

ng/ul prior to amplification. A 2-5 ul aliquot of the 
bacterial genomic DNA is combined with 5 ul of 10X 
reaction buffer, 2 ul of a dNTP mixture (5 mM ea,) f 
2.5 ul each of two 11-base oligonucleotide primers {P3 
30 and P4 at 50 ng/ul) and 34 ul of deionized water. (The 
primers axe obtained from Research Genetics ?tnd are used 
without further purification.) This mixture is heated 
to 94°C for 5 minutes and 2.0 units of a thermophilic 
DNA polymerase is added to the mixture. Acceptable 
35 polymerases include Tag Polymerase/ a thermostable DNA 
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polymerase isolated from Thermus ftgygUcus xTl according 
to a purification procedure developed by Cetus 
Corporation, and available from Perkin Elmer Cetus, 
Norwalk, CT. Twenty-eight amplification cycles are 
5 performed on an automated thermocycler . The cycle 

format is as follows: 0.5 minutes at 94°C (denaturing 
step); 2.75 minute ramp to 43°C; 8 minutes at 43°C 
(annealing step); 3.5 minute ramp to 72°C and 2 minutes 
at 72 P C (synthesis step). The final cycle is followed 

10 by an additional 7 minutes at 72°C to allow partial 

polymerizations to run to completion. At the end of the 
cycling program 1 ul of EDTA (0.5M) is added as a stop 
solution and the products are stored at 4°C. 

>.l»r;i-r«phor ft»H.s of Anrol if ication Product 

15 A 5 ul aliquot of the reaction mixture is removed 

and combined with 2 ul of loading buffer (15% Ficol, and 
0.25% xylene cyanol) . The mixture is loaded onto a 4% 
acrylamide/bis gel (29/1) and separated by 
electrophoresis. The gels are stained with ethidium 

20 bromide and are photographed on a UV transilluminator . 
Other techniques for visualizing the RS fragments and 
the AG fragments may be selected. One such popular 
alternative is the incorporation of fluorescence-labeled 
deoxynucleotides during amplification. Another 

25 alternative is spectroscopy. 

EXAMPLES 

Figures 2-6 were generated by carrying out 
amplification reactions using the procedure described in 
the section "Amplification of rDNA spacer regions" on 

30 genomic DNA samples from the specified bacteria. 
Figures 7-10 were generated by carrying out 
amplification reactions using the procedure described in 
the section "Amplification of rDNA spacer regions and 
arbitrary genomic regions" on genomic DNA samples from 

35 . the specified bacteria . Amplification products were 
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separated on an acrylamide gel and stained with ethidium 
bromide . 

Bacteria representing eight different genera were 
chosen for amplification of RS fragments. Multiple 
5 species from the genera Listeria, and Staphylococcus, 
and multiple serotypes from the genus Salmonella were 
chosen since these represent a "significant number of 
pathogenic microorganisms whose identification ' and 
characterization is particularly important. Five 
10 species were examined from the genus Escherichia and 
eight additional species taken were from four genera 
which axe related to the pathogenic species of interest . 
Examples of the fragment size profiles produced in 
amplification reactions for these bacteria and resolved 
15 by electrophoresis are shown in Figures 2 through 6. 

Bacteria representing five different genera were 
chosen for the combined amplification of the RS and AG 
fragments (RS/AG) . Multiple serotypes and strains were 
chosen from the genera Listeria, and Staphylococci! S. 
20 Multiple serotypes were chosen from the genus 

Salmonella* Four strains were examined from the genus 
Escherichia and two strains taken from Citrobacter 
freundii. Examples of the fragment size profiles 
produced in amplification reactions for these bacteria 
25 and resolved by electrophoresis are shown in Figures 7 
through 10. The size fragments which are common to both 
the RS and the RS/AG amplif ications for each strain are 
indicated by arrows in the Figures 7-10. 

The lanes containing amplifications of bacterial 
30 DNA are numbered in each figure and the strain number 
corresponding to that lane is given in the Figure 
legend. The unnumbered lanes contain time markers which 
are used to assign sizes to the amplification products. 
The sizes of the time markers are as follows: 220, 412, 
35 693, 1331, and 2306 base pairs (bp) . The sizes of the 
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fragments produced in the amplifications were calculated 
from the positions of these fragments relative to the 
time markers. The uncertainty in the calculated sizes 
of the amplification products is approximately 2%. 
5 EXAMPLE 1 

Figure 2 shows the pattern for the amplification of 
RS fragments for four species from the genus Listeria . 
Lanes 1-4 show the patterns for four strains of Listeria 
monocytogenes. All four strains of L. monocytogenes 

10 show the same pair of fragments at 355 and 625 bp. The 
breakdown of remaining species of Listeria is as 
follows: lanes 5 and 6, L. welshimeri with fragments at 
355 and 640 bp; lanes 7 and 8, L. innocua with fragments 
at 355 and 655 bp; lanes 9 and 10, L. ivanovii with 

15 fragments at 380 and 605 bp. There were no size 

fragments which were common to all species of Listeria. 

SX^ffLE 2 

Figure 3 shows the pattern for the amplification of 
RS fragments for four serotypes from the genus 

20 Salmonella. Patterns for four strains of Salmonella 

typhlmurim are shown in lanes 1-4. All four strains of 
S. typhlmurim show the same set of fragments at 480 , 
650 f and 675 bp. The breakdown of remaining serotypes 
of Salmonella is as follows: lanes 5 and 6, both 

25 strains of 5. enteritidis show common fragments at 480 , 
600/ and 650 bp; lanes 7 and 8/ both strains of S. 
newport show common fragments at 480/ 570/ and 610 bp 
and S. newport #707 also shows an additional fragment at 
510 bp; lanes 9 and 10/ both strains of S. infantis show 

30 common fragments at 495/ 560, 630 and 655 bp. All the 
serotypes of Salmonella shown contain a common fragment 
in the region of 480-4 95 bp. However, all of the 
serotypes tested also contain unique product size 
profiles . 
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Figure 4 shows the pattern for the amplification of 
RS fragments for five species from the genus 
Staphylococcus. Patterns for eight strains of 
5 Staphylococcus aureus are shown in lanes 1-8. The 
strains in lanes 1-3 all show a common pattern with 
fragments of 425 f 465, 565 r and 705 bp. The remaining 
five strains found in lanes 4-8 show a series of diverse 
patterns with only a few elements in common, e.g., 
10 fragments of 485, 570 and 620 bp can be found in three 
of the five strains. However, no other size of 
fragments are common to more than two of the five 
strains. The remaining species of Staphylococcus are 
shown in lanes 9-16. The breakdown of these species is 
15 as follows: lanes 9 and 10, both strains of S, scuiri 
show common fragments at 345, 445 and 535 bp and S. 
scuiri #868 also shows an additional band at 530 bp; 
lanes 11 and 12, 5. warneri with fragments at 455, 505, 
525 and 545 bp; lanes 13 and 14, S. saprophyticus with 
20 fragments at 470, 550, and 640 bp; lanes 15 and 16, both 
strains of S. epidermidus show common fragments at 390 
and 590 bp with S. epidermidus #796 showing an 
additional fragment at 520 bp and S* epidermidus #788 
showing two additional fragments at 440 and 490 bp. The 
25 intraspecies variation in the pattern of amplification 
of RS fragments is far greater for S* aureus than for 
the other species of Staphylococcus which were 
evaluated. This is an example of a case where the 
information in the pattern generated by the 
30 amplification of RS fragments can differentiate below 
the level of species. Other than Staphylococcus aureus 
all the other species of Staphylococcus were 
characterized by the production of a single 
characteristic fragment which was present at a far 
35 greater level than th other amplification products. 
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These characteristic fragments are as follows; S. scuiri 
at 345 bp, S. warneri at 505 bp, S. saprophytics at 470 
bp, and S. epidermidus at 390 bp. The intragenic 
variation in patterns found within the genus 
5 Staphylococcus is also significantly greater than the 
variation found in the genera of Salmonella and 
Listeria . 

EXAMPLE 4 

Figure 5 shows the pattern for the amplification of 
10 RS fragments for five species from the genus 

Escherichia. Lanes 1-4 show the patterns for four 
strains of Escherichia coli. All four strains of 
Escherichia coli show the same pair of fragments at 4 80 
and 530 bp. Single examples of four additional species 
15 of Escherichia are shown in lanes 5-8. None of these 
species appear to have any fragments in common with 
Escherichia coli although the Escherichia fergusonii in 
lane 6 does show a similar fragment size differential 
with fragments of 495 and 550 bp- There were no size 
20 fragments which were common to all species of 
Escherichia. 

EXAMPLE 5 

Figure 6 shows the pattern for the amplification of 
RS fragments for eight additional species taken from 

25 four genera which are related to the pathogenic species 
of interest. Lanes 1-4 show patterns for two species of 
Citrobacter. The two strains of C. freundii shown in 
lanes 1 and 2 both gave identical patterns with 
fragments of 315, 490, and 615 bp. The strains of C. 

30 diversus in lanes 3 and 4 also gave identical patterns 
with fragments of 4 60 and 615 bp. Lanes 5-10 show 
patterns for three species of JSnterobacter. The two 
strains of JET. aerogenes shown in lanes 5 and 6 yield 
similar patterns with a common fragment at 4 65 bp. The 

35 other two fragments are 280 and 590 bp for E . aerogenes 
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#62. For E. aerogenes #167 a similar pattern of 
fragments each shifted 10 bases longer is seen. The two 
strains of E. agglomerans in lanes 7 and 8 show 
identical patterns with fragments of 45 0, 475 and 580 
5 bp. In lanes 9 and 10 both strains of E+ cloacae show 
fragments at 310 ,465, and 580 bp. Lanes 11-14 show 
patterns for two species of Protevs. The two strains of . - 
P. mirabilis in lanes 11 and 12 show. identical* patterns 
with fragments at 505, 665 and 850 bp. In lanes 13 and 
10 14 both strains of P. vulgaris show fragments 590, 840/ 
930, 1Q70, and 1200 bp. Lanes 15 and 16 show patterns 
for two strains of Yersinia enterocolitica . Both 
strains show a pair of fragments at 745 and 815 bp. 

15 Figure 7 shows the pattern for the amplification of 

RS and AG fragments for three species from the genus 
Listeria. Lanes 1-4 show the patterns for four strains 
of Listeria monocytogenes. All four strains of L. 
monocytogenes show a common pair of fragments at 355 and 

20 625 bp which are the same as the RS fragments observed 
in Figure 2 for the same strains of Listeria 
monocytogenes. The remaining components in the pattern 
are the result of arbitrary genomic amplifications . 
These components distinquish three of the four L. 
25 monocytogenes strains. L. monocytogenes #891 and #899 
generate the same amplification products. The breakdown 
of remaining species of Listeria is as follows: In 
lanes 5 and 6 L. welshimeri show a common pair of 
fragments at 355 and 640 bp which are the same as the RS 
30 fragments observed in Figure 2 for the same strains of 
Lm welshimeri. The remaining components in the pattern 
are the result of arbitrary genomic amplifications and 
can be used to distinquish between the two strains of L> 
welshimeri. In lanes 7 and 8, L. innocua show a common 
35 pair of fragments at 355 and 655 bp which are the same 
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as the RS fragments observed in Figure 2 for the same 
strains of L. innocua. The remaining components in the 
pattern are the result of arbitrary genomic 
amplifications and can be used to distinguish between 

5 the two strains of L. innocua. 

EXAMPLE 7 

Figure 8 shows the pattern' for the amplification of 
RS and AG fragments for three serotypes from the genus 
Salmonella. Patterns for four strains of Salmonella 

10 typhimurim are shown in lanes 1-4. All four strains 

show a similar set of fragments at .480, 650, and 675 bp 
which are the same as the RS fragments observed in 
Figure 3 for the same strains of S. typhimurim. The 
remaining components in the pattern are the result of 

15 arbitrary genomic amplifications. These components 
distinguish S. typhimurim #590 from the other three 
strains. The breakdown of remaining serotypes of 
Salmonella is as follows: In lanes 5 and 6 both strains 
of S. infantis show common fragments at 490, 560, 630 

20 and 655 bp which are the same as the RS fragments 

observed in Figure 3 for the amplifications of the same 
strains. The additional fragments, which are the result 
of arbitrary genomic amplifications, show similar 
patterns for both strains of S. infantis. In lanes 7 

25 and 8, both strains of S. enteritidis show common 

fragments at 480, 600, and 650 bp which are the same as 
the RS fragments observed in Figure 3 for the 
amplifications of the same strains. The additional 
fragments, which are the result of arbitrary genomic 

30 amplifications can be used to discriminate between these 
strains. All of the RS/AG patterns generated for the 
genus Salmonella were highly related. However, the 
individual Salmonella serotypes can be distinguished 
based on their unique product size profiles in the RS/AG 

35 amplifications. 
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Figure 9 shows the pattern for the amplification of 
RS and AG fragments for three species from the genus 
Staphylococcus. Patterns for four strains of 
5 Staphylococcus aureus are shown in lanes 1-4. . The 

strain #807 in lane 1 shows fragments at 425, 465, 565 , 
and 705 bp which are the same as the RS fragments 
observed in Figure 4 for the same strain. The * remaining 
components in the pattern are the result of arbitrary 
10 genomic amplifications- In lanes 2 and 3 Staphylococcus m 
aureus #1098 and #1097 both .show common fragments at 
565, and 605 bp with #1098 showing an additional 
fragment at 505 bp and #1097 showing an additional 
fragment at 525 bp. All of these fragments are the same 
15 as the RS fragments observed in Figure 4 for the same 
strains. The remaining components in the pattern are 
the result of arbitrary genomic amplifications and can 
be used to more easily distinguish between these two 
strains of Staphylococcus aureus. The strain #795 in 
20 lane 4 shows fragments at 470, 515, 565, and 610 bp 
which are the same as the RS fragments observed in 
Figure 4 for the same strain. The remaining components 
in the pattern are the result of arbitrary genomic 
amplifications. The remaining species of Staphylococcus 
25 are shown in lanes 5-8. Both strains of 5. warnerl, 

#793 and #797, shown in lanes 5 and 6 exhibit fragments 
which correspond in size to the RS fragments observed 
for S. warner i in Fiqure 4* The additional fragments, 
which are the result of arbitrary genomic amplifications 
30 can be used to discriminate between these strains. Both 
strains of S. epidermldus, #788 and #796, shown in lanes 
7 and 8 exhibit fragments which correspond in size to 
the RS fragments observed in Fiqure 4 • The additional 
fragments, which are the result of arbitrary genomic 
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amplifications can be used to discriminate between these 
strains . 

Figure 10 shows the pattern for the amplification 
5 of RS and AG fragments for four strains from Escherichia 
coli. Lanes 1-4 show the patterns for four strains of 
Escherichia coli. All four strains of Escherichia coli 
show a pair of fragments at 480 and 530 bp which are the 
same as the RS fragments observed in Figure 5 for the 

10 amplifications of the same strains . The remaining 

components in the pattern are the result of arbitrary 
genomic amplifications. These components can 
'distinguish between all four strains of Escherichia 
coli. The two strains of Citrobacter freundii shown in 

15 lanes 5 and 6 both exhibit patterns which, contain 
fragments of 315, 490, and 615 bp. These fragments 
correspond in size to the fragments produced in the RS 
amplification of C. freundii shown in Figure 6. The 
additional fragments, which are the result of arbitrary 

20 genomic amplifications can be used to discriminate 
between these strains. 

It is to be understood that the processes of the 
present invention may be modified according to practices 
of those skilled in the art without departing from the 

25 spirit and the scope of the invention herein. 
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SEQUENCE LIS2XNG 

(1) GENERAL INFORMATION: 

(i) APPLICANT: JENSEN, MARK A 
STRAUS, NEIL A 

(ii) TITLE OF INVENTION: METHOD FOR THE IDENTIFICATION OF ♦ 
MICROORGANISMS BY THE UTILIZATION OF DIRECTED AND 
ARBITRARY DNA AMPLIFICATION 
(Hi) NUMBER OF SEQUENCES : 4 
<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: E. I. DU PONT DE NEMOURS AND COMPANY 

(B) STREET: BARLEY MILL PLAZA 36/2122 

(C) CITY: WILMINGTON 

(D) STATE: DELAWARE 

(E) COUNTRY: USA 

(F) ZIP: 19880-0036 
(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
<B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: Macintosh, 6 . 0 

(D) SOFTWARE: Microsoft, 4. 0 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: U.S. SER. NO. 07/803,302 

(B) FILING DATE: DECEMBER 4, 1991 

(C) CLASSIFICATION: 
(viii) ATTORNEY /AGENT INFORMATION : 

(A) NAME: HAMBY, WILLIAM H 
(C) REFERENCE/DOCKET NUMBER: MD-0101 
(ixj TELECOMMUNICATION INFORMATION r 

(A) TELEPHONE: 302-992-4930 

(B) TELEFAX: 302-892-7949 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAAGTCGTAA CAAGG 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CAAGGCATCC ACCGT 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAAGTCGTAA C 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

AAGGCATCCA C 
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CLAIMS 

In the claims: 

5 

1. A method for the identification of the species 
of a microorganism comprising: 

(a) isolating genomic DNA from the 
microorganism, said genomic DNA comprising variable 

10 sequences interspersed between highly conserved 
sequences; 

(b) amplifying said variable sequences 
thereby producing fragments having particular 
distributions in size and number; and 

15 (c) separating said fragments by size and 

analyzing the distribution of said fragments thereby 
determining the species of the microorganism. 

2. The method of Claim 1 wherein said highly 
conserved sequences are highly conserved rDNA sequences, 

20 and said fragments are RS fragments. 

3. The method of Claim 2 wherein said variable 
sequences are spacer regions . 

4. The method of Claim 1 wherein step (a) 
isolation comprises cellular lysis and deproteinization . 

25 5- The method of Claim 2 wherein step (b) 

comprises annealing at least one pair of oligonucleotide 
primers to said highly conserved rDNA sequences, and 
introducing a nucleic acid polymerase and nucleoside 
triphosphates to said primers, suitable to amplify said 

30 variable sequences, 

6. The method of Claim 5 wherein said amplifi- 
cation is performed at least twice. 

7. The method of Claim 5 wherein annealing occurs 
at a temperature of up to about 72°C and during a period 

35 of about 1-20 minutes. 
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8, The method of Claim 5 wherein annealing occurs 
at a temperature of up to about 55°C and during a period 
of about 1-20 minutes. 

9. The method of Claim 8 wherein annealing occurs 
5 during a period of about 2-10 minutes. 

10. The method of Claim 5 wherein said 
oligonucleotide primers are 15-17 bases in length. 

11 • The method of Claim 5 wherein the nucleic acid 
polymerase is a DNA polymerase and the nucleoside 
10 triphosphates are deoxyribonucleoside triphosphates. 

12. The method of Claim 11 wherein the DNA 
polymerase is thermostable. 

13. The method of Claim 2 wherein step (c) 
comprises resolving said RS fragments by electrophoresis 

15 and thereafter visualizing said.RS fragments. 

14. The method of Claim 13 wherein step (c) 
further comprises comparing the electrophoresed, 
visualized RS fragments to a database of microorganisms. 

15. The method of Claim 14 wherein the database 
20 contains fragment distributions which are characteristic 

for Listeria monocytogenes, Listeria innocua, Listeria 
ivanoviir Listeria welshimeri, Salmonella typhimurium, 
Salmonella enteritidis, Salmonella newport. Salmonella 
infantis, Staphylococcus aureus r Staphylococcus scluri, 

25 Staphylococcus saprophyticus r Staphylococcus warneri. 
Staphylococcus epidermidus, Escherichia coli, 
Escherichia blattae, Escherichia fergusonii, Escherichia 
hermaniir Escherichia vulneris r Citrobacter freundii, 
Citrobacter diversus, Enterobacter aerogenes, 

30 Enterobacter agglomerans, Enterobacter cloacae, Proteus 
mirabilis, Proteus vulgar!, and Yersinia enterocolltica * 

16. The method of Claim 13 wherein in step (c) 
visualization is conducted by staining said RS fragments 
with ethidium bromide or detecting the position of said 

35 RS fragments by spectrophotometry procedures. 



WO 93/11264 



PCT/US92/10217 



35 

17. The method of Claim 13 wherein in step (c) 
visualization is conducted by incorporating 
fluorescence-labeled deoxynucleotide during 
amplification (b) • 
5 18. The method of Claim 1 useful in determining 

species of bacteria. 

19. A method for the identification of the 
species f serotype and strain of a microorganism, 
comprising: 

10 (a) isolating genomic DNA from the 

microorganism, said genomic DNA comprising variable 
sequences interspersed between highly conserved 
sequences in addition to arbitrary regions along the 
genome thereof; 

15 (b) amplifying said variable sequences 

thereby producing fragments having particular 
distributions in size and number; 

(c) amplifying said arbitrary regions thereby 
producing AG fragments having particular distributions 

20 in size and number simultaneously with (b) ; and 

(d) separating said fragments and said AG 
fragments by size and analyzing the distribution of said 
fragments and said AG fragments thereby determining the 
species, serotype and strain of the microorganism. 

25 20. The method of Claim 19 wherein said highly 

conserved sequences are highly conserved rDNA sequences, 
and said fragments are RS fragments. 

21. The method of Claim 20 wherein said variable 
sequences are spacer regions. 

30 22. The method of Claim 19 wherein step (a) 

isolation comprises cellular lysis and deproteinization . 

23. The method of Claim 20 wherein steps (b) and 
(c) comprise annealing at least one pair of 
oligonucleotide primers to said highly conserved rDNA 

35 sequences and to a plurality of said arbitrary regions, 
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and introducing a nucleic acid polymerase and nucleoside 
triphosphates to said primers / suitable to amplify said 
variable sequences and said arbitrary regions. 

24. The method of Claim 23 wherein said 
5 amplification is performed at least twice. 

25. The method of Claim- 23 wherein annealing 
occurs at a temperature of up to about 43-4 6°C and 
during a period of about 1-10 minutes. 

26. The method of Claim 25 wherein annealing 
10 occurs during a period of about 5-10 minutes. 

27. The method of Claim 23 wherein said 
oligonucleotide primers are 10-12 bases in length. 

28. The method of Claim 23 wherein the nucleic 
acid polymerase is a DNA polymerase and the nucleoside 

15 triphosphates, are deoxyribonucleoside triphosphates. 

29. The method of Claim 28 wherein the DNA 
polymerase is thermostable. 

30. The method of Claim 20 wherein step (d) 
comprises resolving said RS fragments and said AG 

20 fragments by electrophoresis and thereafter visualizing 
said RS fragments and said AG fragments. 

31. The method of Claim 30 wherein step (d) 
further comprises comparing the electrophoresed, 
visualized RS fragments and AG fragments to a database 

25 of microorganisms. 

32. The method of Claim 31 wherein the database 
contains fragment distributions which are characteristic 
for species, serotype and strain of .Listeria 
monocytogenes t Listeria innocua, Listeria ivanovii, 

30 Listeria welshimeri, Salmonella typhimurium, Salmonella 
enteritidis, Salmonella newport, Salmonella infantis. 
Staphylococcus aureus, Staphylococcus sciuri, 
Staphylococcus saprophyticus, Staphylococcus vrarneri, 
Staphylococcus epidermidus, Escherichia coli r 

35 Escherichia blattae, Escherichia fergusonii r Escherichia 
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hermanil, Escherichia vulneris, Citrobacter freundii, 
Citrobacter diversus, Enterobacter aerogenes, 
Enterobacter agglomerans, Enterobacter cloacae, Proteus 
mirabilis, Proteus vulgar! , and Yersinia enterocolitica. 
5 33. The method of Claim 30 wherein in step (d) 

visualization is conducted by staining said RS fragments 
and said AG fragments with ethidium bromide or detecting 
the position of said RS fragments and said AG fragments 
by spectrophotometric procedures . 

10 34. The method of Claim 30 wherein in step (d) 

visualization is conducted by incorporating 
fluorescence-labeled deoxynucleotide during 
amplification (b) and (c) . 

35. The method of Claim 1 useful in determining 

15 species, serotype and strain of bacteria. 
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